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Abstract. Let A be an n x n random matrix with iid entries taken from the p-adic integers 
Zp or Z/iVZ. Then under mild non-degeneracy conditions the cokernel of A has a universal 
probability distribution. In particular, the p-part of a random matrix over Z has cokernel 
distributed according to the Cohen-Lenstra measure, 

1 °° 

P(coker^ = G) = ' ^^'^ ^ ^^""^'"^ 

where the constants are absolute. 



1. Introduction 



The Cohen-Lenstra measure is a probabihty distribution on isomorphism classes of finite 

abehan p-groups, given by 
O . ^ oo 

I It appears in various guises as a candidate for a random abehan p-group. Its best known 
appearance is in the conjectural work of Cohen and Lenstra [2j, where based on numerical 
evidence they conjectured that the p-part of the ideal class group of quadratic number fields 
are distributed according to u. These conjectures were extended to other number fields, in 
particular by Malle 0, as long as the number field does not contain pth roots of unity. 
' It was observed in [3] that if i? : — )• ZJJ is a random matrix with iid entries chosen 
^ ■ according to Haar measure, then the cokernel distribution of B converges to the Cohen-Lenstra 
measure as n — )■ oo. In this article we show that this property holds for random matrices in a 
Q I much larger class. 

■ Let ^ G be a random variable. We define the concentration constant a = a (^) to be the 
largest value < a < 1 such that we have the non- degeneracy condition 



sup = t mod p) < I — a. 



If y4 G M{n, Zp) is an iid random matrix whose entries have concentration constant a, then we 
say that A has concentration constant a as well. 

We have the following universality principle for random matrices with iid entries in the p-adic 
integers. 

Theorem 1.1. Let A G M{n,'Zp) be a discrete random matrix with concentration constant a. 
Then for all finite abelian p-groups G, 



^ oo 

P(coker A = G) = - p-') + 0(e 



—can\ 



where the constants are absolute. 
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We can also control the probability distribution of the cokernel of matrices over other rings. 
Let N denote a positive integer with u = uj{N) distinct prime factors. Let B : (Z/A^Z)" — >■ 
(Z/A^Z)" be chosen uniformly from all Z/A^Z-module morphisms and let A be a random n x n 
matrix over Z/A^Z with iid entries distributed according to a random variable C,- We define the 
concentration constant a{^) analogously to the p-adic case; namely, < a < 1 is the largest 
number such that 

sup sup P(x = t mod p) < I — a. 

p\N tez/pZ 

Then we have the following universality result. 

Theorem 1.2. Let A e M{n,'L/N'L) he a discrete random matrix with concentration constant 
a. Then for all finite TLjNTj-modules G, 

P(cokerA = G) = P(coker 5 = G) + 0<^(e-^"'') 

where the constants depend only on u = u{N), the number of distinct prime factors of N. 

As a corollary, we have the following generalization of Theorem 1.2 from [7J to control the 
rank of random matrices over Z/pZ for primes p. 

Corollary 1.3. Let A G M(n,Z/pZ) be a discrete random matrix with concentration constant 
a. Then we have 



,-fc2 n£=fc+i(^ ~ p 
nli(i-p-^) 



P(rankv4 = n - k) = p-k' Lli^i±ll £^ + 0{e- 



for all < k < n, where the constants are absolute. 

For now let i? = Zp or Z/A^Z and let Xi, ...,X„ denote the columns of A. The main idea 
behind Theorem 11.11 and 11.21 is to expose the columns one by one and compute the probability 
distribution of the successive quotients 

where {Xk+i, ■■■,Xn) denotes their span as -R-modules. Conditioning on the isomorphism class 
of the previous quotients, we can partition i?" into sets according to the resulting class of 
the quotient by Xk- It turns out that these sets can be written as the set-theoretic difference 
of i?-submodules of i?" that are constructed in a natural way from the span (X^+i, 
Modulo some simple technical restrictions, it suffices to consider the probability that X^ lies 
in the random submodule ^((Xfc+i, X„)), where is a natural deterministic function that 
takes -R-submodules of i?" to -R-submodules of i?". We will call such (f){{Xk+i, X„)) enlarged 
submodules. 

The analysis of enlarged submodules relies on the swapping lemma of Tao and Vu [10] pT| 
which the author subsequently adapted to finite fields in [7|. This allows us to show that 
enlarged submodules N where P(Xfc G N) deviates significantly from IX-*"!"^ can be induced 
with much higher probability by a different probability distribution. 

For the remaining submodules, we still have P(Xfc G X) deviating slightly from uniform. In 
this setting we generalize the inverse theorem of [7j to show that X-*- contains a "structured" 
vector. Since we can assume that this probability is within a constant factor of IX-*"!"^, it is 
possible to explicitly enumerate all such vectors. 

The key property of A used in the proof is the martingale independence of its columns. 
It should be possible to prove a version of this theorem with much weaker conditions on the 
construction of A. However the independence of the entries is crucial to the argument and it 
is unclear how to avoid this obstruction with current technology. 

Although we only work over i? = Zp and Z/XZ in this article, the argument is amenable 
to much more general analysis. We do not attempt maximum generality to avoid unnecessary 
complexity for the most common applications. It would, however, be interesting to extend 
Theorem 1 1.1 1 to the ring of integers of finite extensions of Qp. These arguments suffice when we 
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assume that the probabihty distribution on the entries, taken modulo the maximal ideal m, are 
"non-degenerate" in the sense that they do not concentrate on an additive coset of Fp/ = R/m. 
However, it is intuitively clear that it should suffice, at least for bounded exponents /, for the 
probability distribution to not concentrate on affine subfields; i.e. subsets of the form /3Fpd + 7 
for some d \ f. This limitation will be considered in forthcoming work. 

From Theorem 11.21 we can recover the torsion-free results of Tao and Vu [10] as well as the 
mod-p results of Charlap, Rees, and Robbins ^ and Kahn, Komlos fS]. As mentioned above, 
we would need control over rings with non-prime residue fields R/m for a full generalization. 

2. Notation 

We will use standard asymptotic notation. For an index variable n and functions /, g of 
n, we write / = 0{g) to mean that there are absolute constants no and C such that for all 
n > uq, f{n) < Cg{n). These constants may change from line to line. Similarly, the equation 
f = g + 0{h) is shorthand for \ f — g\ = 0{h). We use the notations f ^ g and > / to denote 
f = 0{g) when convenient. 

It is convenient to employ probabilistic notation. If E is an event, we let ¥[E) denote its 
probability; if F is another event, then we can express the conditional probability of on F as 
F{E I F) and their intersection as F{E A F). For random variables X we let KX denote their 
expectation. 

For V an i?-submodule of -R", we define 

V[t] := {veR''\tve V} 

and by abuse of notation 

V[oo] := {v e R''' \ tv e V for some t G R}. 

We let (a) denote the ideal in R generated by a; we will also let (a) denote the principal 
ideal ai?" in the free module i?". For positive integers m, n we will let [m, n] denote their least 
common multiple. 

We will use the number theorist's exponential function e{t) := exp(27rit) and ep{t) : = 
exp{2iiit / p) . 

If A is a set we will let j^A and \A\ both denote its cardinality. 

3. The column exposure process 

Consider the set of n x n matrices over the finite field Fp. It is well-known that the propor- 
tion of invertible matrices in this set can be calculated from the sequence of column vectors 
Xi, . . . , Xn and the associated sequence of subspaces 

We:= (X,+i,...,X„) CF^ 

for £ = 0, . . . , ra; we have chosen to consider subspaces starting with X^ for the most convenient 
normalization. In fact, we see that the entire matrix is invertible if and only if codimlV^ = 
n — dim Wi = i for each i. If we let A denote a random n x n matrix over Fp chosen uniformly, 
so that the columns Xi, . . . , X„ are independent random vectors taken uniformly from Fp, then 
we compute 

P(codim We-i = £ - 1 | codim Wg = £) = 1 - P(X^ G We \ codim We = i) 

= l-p-' 

so, in particular, 

n 

F{A is invertible) = JJ(1 

e=i 

We can generalize this computation to compute the cokernel of a random n x n matrix A 
over the ring R, given by 

cokerA := X„). 
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We consider the sequence of submodules 

as £ = 0, . . . , n. We have the natural quotient map 

with kernel equal to (X^), the span of in Rn/Wi. 

In this section we show that the isomorphism class of these quotients can be computed by 
testing the membership of the column vectors Xi in various submodules constructed in a natural 
way from Wi. We define an enlarged submodule to be a submodule of the form 

n (^)+w,[h] 

where J-" is a finite set of pairs in R x R. Since enlarged submodules are functorial in Wg, we 
will typically denote the enlarged submodule by 4>(We). 

The following universality result forms the heart of the argument. 

Proposition 3.1. Let ^ E R be a random variable with concentration constant a. Let k < rjn. 
Then for every enlarged submodule (plWk), conditioning on the isomorphism class 

R^/(l){Wk) = 

for some R-submodule V , we have 

|P(Xfc G cl){Wk)) - \V^\-^\ < 0(e-^'^") 
Here 0<?7<1, c>0 and the implied constant are absolute. 

The bulk of this chapter concerns the proof of Proposition I3.1[ However it remains to show 
that it suffices to consider enlarged submodules. 

The partial quotients contain information about the syzygies of X^+i, ■■■■,Xn over quotients 
of R. We first observe that for R = Zp we can extract the free part of Zp/Wf almost surely. 

Proposition 3.2. For R = Zp we have F{Xe G We[oo]) = 0(6"^°"). 

Proof. For all L > 0, by Proposition 13. II we have 

r{x, G iy,[p^]) = p-^ + ©(e-'^""). 

Now send L oo and note that Wi[oo] = (^^=1 We[p^]. □ 
Proposition 3.3. For R = Z/A^Z we can factor 

R-/W, = ®p\nRI^/{W,)^p) 
where R(p), (We)(p) denote the p-th part of R, We respectively. 

Proof. This is immediate from the Chinese remainder theorem. □ 
Proposition 3.4. With probability 1 — 0(e~'^""') there are isomorphisms 

R^'/We ^R^®Te 
where Ti is a uniquely defined finite R-module. 

Proof. Case R = Xp. We induct downward on C starting with i = n. For i = n the result is 
trivial with T„ = 0. If the result is shown for i, then it suffices to find a finite Zp-module Te-i 
such that Zp^-*^ © Ti_i = Z^ © Ti/ {Xi). Let Vi, vi be some basis for Z^ and write 

Xi = aiVi + ■ ■ ■ + aeVi + t 

with if: G Tf, this decomposition is unique. By Proposition 13.21 we know that with probability 
1 — 0(e~'^""') at least one of the coefficients ai,...,a£ is non-zero. Suppose without loss of 
generality that the power of p dividing is the smallest among ai, . . . , a^. Consider the short 
exact sequence 
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> © T,/{Xe) © Te/{Xe, vi, > 

where l is the inclusion map on vi, ...,Vi-i and tt is the indicated quotient map. It is easy to 
see that ® Te/ {Xe, vi, v^-i) is finite: each element of Te is torsion, while the generators of 
Zp either map to or are torsion. Let 

:^Zl®Te/{Xe,vi,...,ve^,). 

By the splitting lemma, it suffices to define a map of Z^-modules ip : Z^ © T^/ (Xf) Z^"^ such 
that '4> o bis, the identity on Z^"^. 

Indeed, for w e Z^ © T^/ {X^) choose a representative 

w = biVi + • • • + biVi + t' 

where t' e T^. Now = (3 for some r > and /3 e Z^ ; similarly let = p'^a for some s > 
and q; e Zp . If r > s then e Zp and so there is a unique representative 

w - —Xe e . . . , ve-i) © T^. 

We then let ^/'('u;) be the projection onto (vi, v^-i) of this representative. If r < s, then 
p'^~^be/a£ e Zp and so there is a unique representative of p^~^w given by 

ps-r^ _ ^l^x^ e . . . , ve-i) © T>. 

Furthermore, the coefficients of vi, . . . ,ve-i of this representative must all be divisible by p*^*" 
(this is because generates the largest ideal), so we let ip{w) be times the projection 

onto of this representative. It is clear that the resulting map is linear and commutes 

with multiplication by elements of Zp, as required. 

Case R = Z/A^Z. By the Chinese remainder theorem it suffices to prove the result for N = p^ 
for some r > 1. But this follows from the previous case. □ 

We now have a sequence of finite i?-modules T„, . . . , Tq with T„ = and Tq = coker A. For 
any matrix A (with probability 1 — 0{e~'^"'^)) this sequence is uniquely defined. Therefore, for 
Hn, . . . ,Hq any sequence of finite i?- modules with if„ = and Hq — coker A we can write 

P(coker A = G)= ^ P(coker A = G and Tj = Hj for aU j) + 0(e"'""). 

Hn-l,...,Hi 

Next we will consider what sequences Hn, . . . ,Ho can arise as the torsion parts of the partial 
quotients R^-fW^. 

3.1. Classifying partial cokernels. We can classify finite i?-modules in terms of Young di- 
agrams. Recall that a Young diagram (or diagram) is a graphical display of a partition, 
where to the set [m] and the partition j'l > j2 > • • • > jt > with ji + ■ ■ ■ + jt — m we have 

the corresponding Young diagram with t rows, where in the A:th row there are jk boxes. For 
example, the partition of [7] into 4 > 2 > 1 corresponds to the Young diagram 



For R = Zp, any finite Zp- module T is a finite abelian p-group; by the fundamental theorem of 
finite abelian groups, we know that T is isomorphic to 

T ^ (Z/p^'iZ) © (Z/p^'^Z) © • • • © (Z/p^*Z) 



6 KENNETH MAPLES 

with ji > j2 > ■ ■ ■ ^ it > 0, and this representation is unique. We therefore have a one-to-one 
correspondence between partitions ji > j2 > • ■ ■ > jt > of [logp M] and finite Zp-modules of 
cardinahty M. 

For R = Z/A^Z, if we factor R = (Bpk^^j^Z/p^Z then we have a correspondence between finite 
i?-modules T and tuples of Young diagrams, one for each prime dividing A^, such that there 
are at most k columns in the Young diagram for p if p'^WN. We will call the diagram of T 
corresponding to the prime p the p-diagram of T. 

Recall that the matrix A induces a sequence Tn, . . . ,To of finite i?- modules with probability 
1 — 0{e~'^°'^). It turns out this sequence is strictly growing in the sense that Ti ^ T^_i. 

Proposition 3.5. With probability 1 — 0{e^'^°'"') there is an injection Tg Tg^i. 

Proof. We see that (pg : R"'/Wi — )■ R'^/Wg^i commutes with the isomorphisms from Proposi- 
tion 13.41 to give a map 

(t)r.R^®Tt^ R^~^ ® Tf^^i 

with kernel equal to (X^), the span of the image of Xi in R^ © Ti. 

U R = Zp then by Proposition 13.21 we have Xg ^ Ti with probability 1 — 0{e~'^°'"'). In 
particular (Xi) HTg = {0}, so (pe is injective when restricted to Ti. Since all elements of Te have 
finite order their images must have finite order as well, and in particular must be contained in 
Ti_i. Thus we have 0^ : T^_i as required. 

li R = Z/NZ, it suffices to prove that (T^)(p) ^ (T£_i)(p). This is obvious after changing 
coordinates by the classification theorem for finite abelian p-groups. □ 

For any finite i?-module T, we define the j-rank of T at p to be 

'■= Tankv^{p^~^T/p^T). 

From the correspondence we see that (p) is equal to the length of the jth column of the 
p-diagram of T. It turns out that there is considerable rigidity in the change of (p)(T£) as i 
varies. 

Proposition 3.6. For all j > 1, 1 < i < n, and p we have 

rj,{p){Te) < rj-(p)(T£_i) < rj-,(p)(T£) + 1. 

Furthermore, rj^(p)(T^_i) = rj^(^p){Tf) + 1 if and only if 

{Xe) n p>-\R^ © Te) mod p' = {0} 

Proof. Let 0^ : i?^ © — )■ R^~^ © T^_i be the map from the proof of Proposition 13. 5[ We can 
restrict (pi to the submodule p-'~"'^(i?^©T^) to derive a map cp'^ : pP'^^R^QTf) — )■ p'~^(/2^~-'^©T^_i). 
We can then quotient by pP and note that pPi^R^ © T^) maps to zero to derive a map 

: pi-\R' © Ti)/p>{R' © Ti) ^ p>~\R'-^ © Ti^{) / p> {R'-^ © r,_i) 

This is a map of finite dimensional vector spaces over Fp. The quotients factor over the sums 
so we can write 

: © ip^-'Ti/p^Ti) ^ Fj-i © {p^-'Ti_^/p^Ti^^) 

(p is the composition of two quotient maps so it is surjective. Therefore, by the definition of 
the j-rank, we have 

i + rj{Ti)>i-l + r,{Ti.i). 

The other inequality follows immediately from Proposition 13. 5[ 

For the second statement, note that (p is an isomorphism if and only if the kernel of (p 
restricted to p>~^{R^ © Ti) is contained in p'{R^ © Ti). □ 
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This last proposition characterizes sequences H^, ■ ■ ■ , Hq = cokerA which arise from se- 
quences Tn, . . . , To = coker A. We first recall some standard notation on Young tableaux. 

On a Young diagram A, we define a numbering of A from [n] to be an assignment, for each 
box in the diagram, of an integer from [n]. We then say that A is the shape of the numbering. 
We define a semi-standard tableau of shape A to be a numbering which is weakly decreasing 
along rows and strictly decreasing along columns; for example. 



4 


3 


3 


1 


2 


2 






1 









is a tableau of shape A = 4 > 2 > 1. We will write H/X to indicate that if is a Young tableau 
of shape A. Note that we have swapped the usual convention of increasing rows and columns 
for later notational convenience. Finally, to a tableau H/ A we define the sub-diagram of H 
above i to be the diagram consisting of those boxes of H whose labels j satisfy j > i. For 
example, the sub-diagram of the tableau above at 2 is 



We will denote the sub-diagram of a tableau H at £ by He. 

Young tableaux are a convenient notation for sequences of partial cokernels, as the following 
corollary illustrates. 

Corollary 3.7. Let A be an n x n matrix over R and let A denote the diagram of coker A. 
Then there is a one-to-one correspondence between sequences T„,...,To with Tn = {0} and 
To = coker A which can occur as the partial cokernels of A and tableaux H/X from [n], such 
that Te has diagram equal to H^. 

Proof Fix the sequence T„, . . . ,To. By Proposition 13. 51 we see that the diagrams A^ associated 
to Ti are nested, so define H/ X to be the numbering of A given by the time the box appears in 
the nested sequence. Clearly the rows of A are weakly increasing. Proposition 13.61 shows that 
the columns increase by at most one each step, so there cannot be two adjacent equal numbers 
in a vertical configuration. The converse is obvious. □ 

3.2. Enlarged submodules. We now see that we can expand 

P(coker A = G)= ^ F{Te = He for all i) + 0(6"'^°"). 

H/X from [n] 

Each term in the sum can be expanded into a product by conditional expectation; namely, 

n 

F{Ti = Hi for all ^) = JJ ^(^^ " I " J > 

1=0 

Each term in the product is not yet in a form that we can estimate. In particular, we would like 
to represent the set of Xe such that Te^i has the desired isomorphism class as the set-theoretic 
difference of certain submodules of i?". 

Consider the two-parameter family of events 

Ga,b ■■= {Xeep'^R^ + Welp']} 
with a > 1 and 6 > 0. We have the obvious inclusions 

Ga+l,b ^ Ga,b ^ Ga,b+1 

SO we can define events 

Ejfi ■= Gj,o 

and 

Ej^t '■= Gj-t,t \ Gj-t,t-i- 
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for 1 < j and 1 < t < j — 1. The events Ej^t as t varies can be used to test the j-rank of the 
p-primary part of T as follows. 

Proposition 3.8. The events Ej^q, . . . , -E'j.j-i are disjoint, and the equality 

rj,(p)(T£_i) = rj-,(p)(T£) + 1 
holds if and only if Ej q U ■ ■ ■ U Ejj^i holds. 

Proof We have Ej^t C Gj-t,t ^ Gg-t~s,t+s-i for all s >1, but Ej^t+s = Gg-t-s,t+s\Gg-t-s,t+s-i, 
so the family is pairwise disjoint. 

First we observe that the condition rj^(p)(T^_i) = rj^(p)(T^) + l is equivalent to {Xi)np^~^{R^Q) 
Ti) mod = {0} by Proposition 13.61 so it suffices to show that Ej^ U ■ ■ ■ U is equivalent 

to this event. 

Suppose that {Xi) fl p^~^{R^ © Te) mod p' = {0}. Let t > be the minimum exponent such 
that p*Xe G p^K^ + We, or equivalently, Xi G p'^'^ + ^^^[p*]. If t = then Ej^ holds; otherwise, 
t > and thus p^~^Xi ^ p^K^ + Wg. But this implies that p^^^Xg ^ p^'^K^ + Wi, or equivalently 
Xi ^ pi-'BJ' + Wi[p'-^]. Therefore E^^t = Gj^t,t \ Gj -t,t-i holds. It remains to show that t < j, 
but if t = j we would have shown that p'^^^Xi ^ p'^^EJ^ + which is absurd. 

Conversely, suppose Ej^t holds. If t = then X^ G p^R"- + so (X^) = {0} mod p'R^ + W^. 
If t > then p^-^Xi ^ p^'^R'' + Wi which implies that (X^) np^'^R'' + WiC {p^Xe). However, 
p^Xi G p^' + Wi so (X^) n p^-^i?" + VT^ = {0} mod p^'. □ 

We have therefore reduced the problem of computing the isomorphism class of Ti_i condi- 
tioned on Wi to testing the membership of 

X,Gp"i?" + W^,[/] 

for various a, 6, and p. 

Precisely, we have the following. Fix £ and let S C Z+ denote those indices where rfc(T^_i) = 
rk{Te) + 1. By Proposition 13.21 5* is finite with probability 1 — 0(e~'^""). We define random 
variables Va^ for a > 1 and 6 > by 



a,b 



Ga,0 — Ga+lfl, 6 — 

Ga,b — Ga+l,b — Ga,b-l + Ga+l,b H Ga,b-l, b > 

We can then compute the following. 

Proposition 3.9. Fix £ > 1 and tableau H/X. Let S C denote as above the set of columns 
of H which contain the number i. Then we have the equality of events 

k-l 

{Te = Hi} = l[Y,V''-t,t 

fees' t=o 

Proof By Proposition 13. 8[ A; G S' if and only if E^^ U ■ ■ ■ U E^^k-i holds. 

First suppose S = {k} for some k. In this case we want to compute the event that some 
Gk-t,t holds for some < t < A; — 1 while no other Ga,b holds. By the nesting property of Ga,b 
and the inclusion-exclusion principle this is equal to Vk-t,t- The case for larger S follows by 
induction and the inclusion-exclusion principle. □ 



Note that if the product in Proposition 13. 91 is expanded over the sum, then some of the terms 
will cancel identically. 

The various submodules {p') + Wi[p''] C i?"^^ for j > 1 and A; > form a grid of inclusions. 



{p^) + Wi[p^] = > {p^) + We[p''+^] 
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We abbreviate 

G,,k := "X, G (p^') + 

and observe that 

Ejfl = Gjfi 

and 

Ej,t = Gj-t,t \ Gj-t,t-i- 
Let ?7 : Z+ — 7- {0, 1} be the function such that 

r,{He-i)=r,{He)+v{j). 

We can associate a diagram to rj by considering a grid with indices j > 1, k > such that (j, k) 
is shaded if and only if 7]{j + k) = 0. For example, if supp r] = {1,4} we have the diagram 



To check that T^.i = i/^-i conditioned on T£ = He, we observe the following properties 

Proposition 3.10. (1) At most one of the events Ej^t can hold for j fixed. 

(2) // none of the events Ej^q, Ej^t hold, then no Ej/^s holds for j' > j and s < t. 

Proof (1) Since Ej^t = Gj t \ Gj^t-i, by the inclusion Gj^s C Gj^t-i for s < t the conclusion 
follows. 

(2) This follows from E^/,, C Gj>^, C Gj^t = ^i=oGj,k- □ 

We now show that these two conditions imply that there is a unique minimal enlarged 
submodule which contains the set which induces Hi. 

Algorithm 1. The minimal enlarged submodule (f){Wi) which contains the set which induces 
Hi corresponds to the value of J-" when the following algorithm terminates. 

(1) Initialize = 0, j = 1, i = 0. 

(2) Find the minimal t > i such that {j, t) is clear in the diagram. If none, then stop. 

(3) Set J"' := U {{j, t)} and relabel T to T' . 

(4) Increment j. 

(5) Jump to 2. 

We finally observe that for each (j, t) G J-" with t > 0, it suffices to subtract the probability for 
the enlarged submodule J-'U {j, t — 1}; the correct value is then found by the inclusion-exclusion 
principle. 

3.3. Entropy bounds. For £ > we observe that any non-trivial column would require that 
Xi G (p) -|- Vr[oo] for some prime "p. However, by Odlyzko's lemma, first recorded in [8], this 
occurs with probability ©(e""^"") so these columns can be ignored. 

Lemma 3.11 (Odlyzko). For any fixed subspace V of and random vector X G with 
concentration constant a, we have the bound 

P(X eV)< 

Proof of Lemma \3.11[ Let k denote the codimension of V. We can find n — k coordinates 
r C [n] such that V is a graph over r. If we condition on the coordinates of X in r, then there 
is a unique choice for the remaining coordinates [n] \ r for X &V . Since /i has concentration 
constant a, the probability that each entry of X assumes the required value is bounded by 
1 — a, and the result follows from the independence of the entries. □ 
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For G sufficiently small, i.e. logplGI = 0(1) for all p, Theorem 11.11 and Theorem 11.21 follow 
immediately from Theorem 13.11 and the decomposition into Young tableaux. However, for G 
sufficiently large there are a super-exponential number of tableaux H/X and we require a finer 
analysis. 

Define the weight of a diagram A to be the sum 

„(A) = ^ ^(i±il 

{i,i)eA 

Note that ^^(A) is the sum of the entries of the minimum semi-standard tableau on A. Then 
there are two possibilities. 

Proposition 3.12. If G is an R-module such that w{X) > en, then Theorem \l.l\ holds. 

Proof. Any semi-standard tableau if on A is dominated by the event that \T(\ is increased by 
pM(fc)^ where ii{k) is the number of copies of k in H. This is bounded from above by the enlarged 
submodule (p^'^(^) + iy[oo]. These enlarged submodules form a nested sequence in so it 

suffices to consider those /x where < {\og p)~^rin =: z/(fc) for 7] sufficiently small. Then 
in the product we can factor out the error. 



+ 0(e-^"")) = (1 + 0(e-^"")) W 



=1 



Finally we note that the number of possible labelings controlled by the envelope z/(/c) is expo- 
nential with constant controlled by ?7, so Theorem 11.11 follows with ri sufficiently small. □ 

If w{X) < en, then we can derive a bound on the number of possible diagrams of shape A. 

Proposition 3.13. Ifw{X) < en then there are 0{e~^^"') semi- standard tableaux with letters 
from [n] on X. 

Proof. Let dx{n) denote the number of semi-standard tableaux with letters from [n] on A. Recall 
from |1] that 

, / X TT n + i-j 

where ri{X) is the length of the ith row of A and Cj{X) is the length of the jth column. We can 
bound 

n (r,(A) + c,(A)-j-z + l)> Hic.iXy.) 

(*j)eA j=i 

= exp(^Cj(logCj - l))0(exp(cA/n)). 



Similarly, 



Yl{n + i-j) < exp(^Cjlogn). 

(.id) i=i 



Therefore 



ri 



dx{n) < exp(^9(logn - log 9 + 1)). 
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Now 



log n 



Cj{\og n - log Cj + 1)<Y^ e*(logn - t + 1) 

t=0 j;e*<Cj<e'+i 
log n 

< X^e*(logn - t + I e* < c, < 6*+^}. 



i=0 



Note that 

so that 
and thus 

as required. 



Cl 



en 



i=l 



#{j I e* < c, < e*+i} < e-*/2y^ 



log n 



c^aI"-) ^ exp(^ e*/^(logr;, -t+ l)y/en) < exp{cn^/e) 



t=o 



□ 



We are now ready to prove Theorem 11.11 and Theorem 11.21 

Proof. Let A be the Young diagram for G. If w{X) > en then we are done. Otherwise, we have 
by the calculation above 

P(cokerA = G) = ^P(. ■■) 

H/X 

n-1 

H/x e=o 



0(e- 



H/x e=o 

= P(coker5 ^ G) + 0(6-^=°") 
where the third inequality is by Proposition 13.131 



□ 



We have therefore reduced the probability distribution of the cokernel to computing the 
probability that Xi lies in the expanded span of the previous columns. The remainder of this 
chapter describes how to prove Proposition 13. 1[ 

4. Universality for enlarged submodules 

We will distinguish between four possible types of submodules that (j)(Wi+i) can represent. 
Let S, d, and D be constants to be chosen later. Intuitively, 6 and d will be of order 1/100 
while D will be about 10. 

sparse: There is a non-zero w -L N such that |supp t^l < 6n; here < 5 < 1 is an absolute 
constant. We will show that sparse submodules are represented with exponentially small 
probability by direct counting in Section [61 

unsaturated: is not sparse and 



max e 



-dan 



D 



< 



P(X e A^) 



1 



\N^\ 



Here D is an absolute constant. We will use a generalized version of the swapping 
argument from [TD] , [H] to show that these appear with exponentially small probabihty 
in Section [71 
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semi-saturated: neither sparse nor unsaturated, and we liave tlie inequality 

D 



^—dan ^ 



< 



\N^\ 



\N^\ 

Note that the set of semi-saturated may be empty if \N-^ \ is sufficiently large. We will 
count the semi-saturated submodules by finding a structured w -L N and then directly 
counting to show that they are represented with exponentially small probability. This 
is done in Section [9l 

saturated: is neither sparse, unsaturated, nor semi-saturated. In particular, these 
submodules satisfy the universality property. 

Proposition 13.11 follows if we can show that 4>(We) represents a saturated subspace with 
probability 1 — 0(e~'^°"), where the constants are absolute. 

5. Analytic and Combinatorial Tools 

Before we finish the proof of Proposition 13. we must recall some theory from analysis and 
additive combinatorics. For more information see [5] and [7]. 

Let /i be a measure on a compact commutative topological ring. For any < e < 1 we define 
the spectrum Spec^.^ fi to be the set of Fourier coefficients with magnitude at least 1 — e; i.e. 

The importance of Spec is that it is closed under a bounded number of set additions, as long 
as e is enlarged sufficiently. Recall that for sets A, B in an additive group Z, we define A + B 
to be the sumset {a + b \ a E A,b & B}. 

Lemma 5.1. For all e > and k a positive integer, we have 

Speci_^ /i + ■ ■ ■ + Speci_^ /i C Speci_,fc2^ 
where there are k summands on the left. 

Recall that Sym(A) denotes the largest subgroup of Z such that A is the union of cosets of 
Sym(A). 

Lemma 5.2 (Kneser). If A,B C Z are additive sets in an ambient, finite abelian group Z, 
then we have the bound 

\A + B\ + |Sym(A + B)\ > \A\ + \B\ 

Iterating Kneser's inequality we have the following corollary. 

Corollary 5.3. Let Ai,...,Ak and B be additive sets in an ambient finite abelian group Z. 
Suppose we have the sumset inclusion 

Ai + --- + AkCB. 

Suppose further that B contains no additive cosets of Z . Then we have 

\B\ + {k-l) > \Ai\ + ■ ■ ■ + \Ak\ 

6. Sparse 

We would like to control the probability that is sparse. By definition, 

F{(f){We+i) is sparse) = P(0(W^+i) ± w non-zero with |suppw| < 6n) 

In particular, if we construct the n x [n — £) rectangular matrix 

B := [Xi^i ■ ■ ■ Xn\ 

then w^B = for some non-zero w with |suppw| < 6n. 

The coefficients of w are contained in a finite i?-submodule of R, since elements of R have 
finite order. We see that there is a maximal ideal m such that w mod m is not constant zero. 
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Reducing modulo this ideal, and possibly shrinking supp w, we conclude that w^B = modulo 
m. 

We can now finish the argument as in [7j. In particular, we can restrict B to those rows 
corresponding to non-zero entries of w. This gives a. \a\ x n — i matrix that is not of full rank. 
Regardless of the choice of spanning columns, the remaining columns must lie in their span 
and in particular be perpendicular to w. We recall the "classical" Littlewood-Offord theorem 
for finite fields from [7]. 

Lemma 6.1. Let X E be a random vector with iid entries taken from a probability distri- 
bution n with concentration constant a. Suppose if G has at least m non-zero coefficients. 
Then we have the estimate 

\F{X -w = r) -q-^\ < 



y/am 



for all r E¥g. 

Combining these estimates, we see that 



F(4>{Wi) is sparse) < ^ Q _ min(l -a,q ^ + 



\n-e-k+l 



'ok 



where q is the smallest residue field for a maximal ideal of R. If we choose 5 sufficiently small 
than this quantity is bounded by 0(e~'^""'). 



7. Unsaturated 

The singularity bounds of Tao and Vu in [10] , [TT] are based on swapping the columns of the 
matrix A with new columns drawn from a more singular probability distribution. Informally, 
if we have random vectors X and Y such that 

P(X eV)< cP(x G V) 

for some < c < 1 and all V in some class of vector spaces, then we can conclude that 

P(X2, span V) < c"-ip(r2, span V) 

modulo some difficulties with linear independence. If we would like to show that the columns 
of A span such V with small probability, it therefore suffices to use much worse bounds (such 
as the trivial bound) for the "swapped in" vectors Y . 

In [7] the author showed that this argument can also be used in the finite field setting as 
long as the vector spaces are not close to saturation for the random vectors. Explicitly, it was 
required that 

|P(X eV)- \V^\-'^\ > D\v^\-^ 

for some D (about 10). 

In our more general setting, the actual swapping lemma is a straightforward generalization 
of the swapping lemma from [7] . We will prove the following swapping lemma in [Hi 

Lemma 7.1. There exists a probability distribution u ^ R with concentration constant a/8 
with the following property. Suppose Y G R"' is a random vector with iid entries taken from v. 
Then for every submodule N <l i?" that is not sparse, we have the inequality 

|P(X eN)- \N^\-'^\ < Q + o{i)^ |P(F eN)- \N^\-\ 

We need a new notion for linear independence. In addition to avoiding linear dependencies, it 
is important that our vectors not introduce additional cokernel. We therefore say that Yi, 
are simply independent in N if Yi, ...,1^^ ^ N and i?"/(Yi, Y^) = R^~^ . We will abbreviate 
simply independent by s.a.. 
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As in [in] and [7], we require a dyadic decomposition on the magnitude of P(X G A^). We 
recall that the combinatorial codimension of is the unique fraction d± G n~^Z^ such that 

(1 - 0)^^*/"+^/" < P(X G A^) < (1 - a)'^±/". 

Since we have the trivial bounds 2"" < P(X G A^) < 1 we see that there are at most O(an^) 
possible combinatorial codimensions, so that it suffices to estimate 

P(0(IV^+i) unsaturated with d±{W) = d±) < ©(g-'^") 

for all d±. 

Let V-^ denote the isomorphism class of (j)(Wi+i)-^. Let A^ be an unsaturated submodules 
with A^-*- = V-^ of combinatorial codimension d±. Let Yi,...,Yr be iid copies of Y given by 
Lemma [7. II and let X[, ...,Xg be iid copies of X. Then we have 

FU(W...) = N) = n0(W^m)=iVAy„-.,y.,Xl,...,X>res.i. in AT) 
' P(Fi,...,F„A:i,...,X>re s.i. in AT) 

Because we have assumed that i?"/ (Fi, F^, ■■■,X's) — R^~^~'^ , we can replace r + s of the 
column vectors X^+i, X„ with Yi, ...,Yr, X[, X'^ with the following proposition. 

Proposition 7.2. Suppose = A^ and Zi,...,Zj are simply independent in N. Then 

there is a subset a C [£ + 1, n] with \a\ = r such that 

(P{{{Xk}kt.,Z^,...,Z,))=N. 

Furthermore, we can choose a so that if k & a, we have kei Hk+i — > Hk = R- 

Proof. It suffices to prove this proposition for r = 1, since the simple independence oi Zi, Zj 
prevents a subsequent choice of Xk from colliding with previously chosen vectors. 
From the definition of (p, we have 

t{Z -V)= Oi+iXi+i H h anXn 

for some v G IR"^ and coefficients G R. We can find k such that (cfc) = (t) as ideals in R. 
In particular, we have Ok = tu for some u G R^ , and we therefore write 

tXk = u'H{Z -v)- M~^a^+iX£+i M"^a„X„. 

Now, any y & N satisfies 

t(y - v') = be+iXi+i H h a^Xn 

but this equation can be rewritten to replace Xk with Z. □ 

Now it is convenient to abbreviate events. We define the events 

D„:=Y,,...,Yr,X[,...,X'^eN 

yi,...,y„X(,...,X; s.i. in AT 



En 



By Proposition 17. 2[ we have 

u ^ [^:-t- ± ,7tj 

\a\=r+s 

First we consider the ratio 



n{Xk}kea e N) 

We can expand the denominator with conditional expectation, 

¥{En) = FiDNMEN I Dm) 

= P(F G X)^P(X G NyF{EN I I^jv) 
For the numerator we need the following independence lemma. 
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Proposition 7.3. Let Zi, Zj be independent random vectors in R"^. Then we have the bound 

3 

P(Zi, ...,Zj&N\Zi,...,Zj are s.i.) < JJ P(Z^ G A^). 

1=1 

Proof. The proof is the same as in [7] except for minor notational differences. Expanding the 
left hand side with conditional expectation, 

r 

JJP(Zj- G iV I Zi, . . . , G N and Zi, Z^ are s.i.) 

i=i 

Let Uj denote the R span of Zi, . . . , Zj_i. We claim that 



¥{Z eN\ Uj) 



< F{Z G A^). 



In fact, 

F{Z eN\ Uj) = F{Z G Uj)¥{Z G iV \ Uj) + F{Z ^ f/j)P(^ G iV \ Uj) 
< ¥{Z G i Uj) + P(Z G iV \ t/,)P(^ ^ Uj) 

= F{Z G iV)P(Z ^Uj). □ 

Since {Xfcjfcgo- are simply independent in by construction, we therefore conclude that 

nWkea e iV) < P(X G N)'+\ 



We now apply Lemma 17.11 Since is unsaturated. 



P(F G A^) - V2 ^ 
Collecting terms, we see that so far we have shown that 



i+i) 



aC[e+l,n 
\cr\=r+s 



Next we control F{En \ Dn) from below. We can expand this probability into the product 

r 

JJP(Fi,...,Fj- s.c. in N \ Yi,...,Yj_i s.c. in A^ and Yi,...,i; G A^) 

along with analogous terms for X'. We can write each term of the product in the form 

- G (Fi, + mR"^ I Y, G N)). 

m 

However, we can bound this with Odlyzko's lemma and the condition on the combinatorial 
CO dimension of A^. 

To finish the argument, we now sum over all submodules A^ of prescribed isomorphism class. 
We then find that 

P(0(W^,+i) is unsat) < J] (1 + 1 + 0(1))^ J] P(F;v,.). 

CTC[^+l,n] N unsat 

\a\=r+s 

We see that the inner sum on the right hand side is always bounded by 1, since the collection 
of vectors can only induce a single submodule. We then choose r,s appropriately to bound the 
whole quantity by 0(e~'^""). □ 
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8. Swapping Lemma 
We will now prove a generalization of the swapping lemma from [10], [H], [7]. 
Proof of Lemma \7.1\ We will abbreviate 7 = 1/8 for convenience. Let 

Here := t). It is trivial to verify that u is a probability distribution and u > 1 — 27. 

It also has concentration constant /3 = 7a = a/8. 
The Fourier transform of v is 



D = 1 - 7 + 7|/i 



2 



It now suffices to verify the swapping inequality. We observe that 

n 

and similar for F{Y G A^). We therefore define 

n 



and 



i=i 



so that it suffices to show that 



tlJ&N^\{0} ^ ' -!/'eA'^\{0} 

We do this by level sets. For m > define 

F{u) := {i\)^N^\ f{i\)) > u} 

and 

G{u) := {ipeN^l giip) > u} 

and likewise let F'{u) = F{u) \ {0} and G'{u) = G{u) \ {0}. 

Let e > be chosen later. We must split the sum for / into two parts: those frequencies ip 
where f{ip) < e and those where fi^ip) > e. 

We first claim that f{ip) < gi^PY- This follows from the pointwise estimate < ^'(t)^, 

which we can prove with the arithmetic-geometric mean inequality: 

{\m?Y"<\{\r^? + ^) = y{t). 

This inequality quickly controls those frequencies where / is small. In fact, 

E f^Y)<e'" E 9{Y) 

so as long as e — )■ as n — )■ 00 this portion is done. 

Now we will consider those frequencies where / is large. In this case, we will apply Kneser's 
ineuality to the sumset inclusion F{u) + F{u) C G(u). 

It suffices to show that |/i(t)/i(s)| < z/(t + s)^. As in [7] we consider two cases. If either /i(t) < 
1 — 47 or < 1 — 47 then the inequality is trivial from the lower bound for u. Otherwise, 
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we write = 1 — 9i and = 1 — ^2- By Lemma EH] we have + s)| > 1 — 2(6*1 + ^2)- 

But by the definition of z/ we get u{t + s) = 1 — 7 + 7|/i(t + s)p > |/i(t)/i(s)|. 

Kneser's theorem tells us that 2|F(m)| < |SymF(M) + F{u)\ + \G{u)\. We would like to show 
that Sym(F(M) + F{u)) = {0}. It suffices to show that G{u) does not contain any non-trivial 
subgroup, as this would in turn guarantee that F{u) + F{u) does not either. 

Let H <l G{u) be minimal, so that H = Z/pZ for some prime p. Choose w G A^"*" that 
generates if as a Z/pZ-module; since is not sparse, we can assume that w contains at least 
6n non-zero entries. 



Define the function 



Averaging h over H, 



h{t) := 5^ 1 - d{t,f 



h{tw) = n — p^^ ujiwiY 

t&l-pL t<^1/p1 i=l 

By Plancherel's theorem and the fact that is not saturated we see that this entire quantity 
is bounded below by 'yaSn. Therefore we simply require that u < 'yaSn, which gives us the 
bound 

as required. □ 

9. Semi- SATURATED 
Let be a semi-saturated submodule. By the inverse Fourier transform, 



P(X G A^) ^ 



' ' ?eAf^\{0} (-=1 



In particular, we can find a C G A^"*" \ {0} such that 

1 " 

exp(-dra) < exp(--^^/'(C£)), 

1=1 

with ip{t) := 1 — taking logarithms gives 

n 

1=1 

Let K be a parameter to be chosen later. For all but k of we have 

2dn 



^(0) < 

K 

or, equivalently, Q G Spec^^^yU for e = 

Proposition 9.1. For all f3 > there exists e > such that for all orders T, Spec^.^/x contains 
at most (5T elements of order T . 

Proof. We first notice that there is an 77 > such that Speci_^/i does not contain any non- 
trivial additive cosets H + s with H <\ R and s G R. In fact, for each such H and s we can 
apply Markov's inequality and use the a-concentration of /i, 

(1 - r^)2#(if + s n Speci_,/x) < J2\l^{h + s)\'< \H\{1 - a) 

heH 
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We therefore choose rf — a/2. With this value, we apply the iterated form of Kneser's inequality 
to find 

k\{t eR\\t\ =r}nSpeCi_,//\{0}| < 10 e -R I \t\ =r}nSpeCi_fc2,//\{0}| 
so we pick k — P"^ and e = k^'^rj. □ 

We now choose e, and therefore d, such that SpeCj_g /i has small cardinality in every finite 
torsion submodule of R; namely /3T for order T elements. 

We can count directly the number of vectors ^ of given order with the above constraint. In 
fact, for order T we have 

#{structured C of order T} < (^^T^{^T)''-^ < /3"r" 

where in the last inequality we chose k, = n/10 and replaced (3 with a comparable value. 

We can count the number of submodules N with prescribed perpendicular isomorphism class 
and perpendicular to a fixed vector. 

Proposition 9.2. Let W-^ be a fixed finite submodule of i?" and let C £ -R" have finite order 
T. Then the number of submodules N < with N-^ = W-^ and C e N-^ is bounded by 

#{iv I C e iv^} < jxS^^m e I Id = T} 

Proof We double count the number of pairs ( e N-^, 

J] #{7V I C e N^} = #{(7V^, C) I C e N^, ICI -T} = J2nC^N^\ ICI = T} 

\C\=T N 

We observe that ^{N \ ( e N-^} is independent of C- In fact, for any two Ci, C2 of order T we 
can find an automorphism i?" — >■ i?" that maps (i to (2] this induces a correspondence between 
submodules of the same isomorphism class. We conclude that 

We have the trivial bound 

^{N I ^ W^} < 

so collecting terms we find 



lAutA^^r 



#{N I CeN^}< jx^^^m e I ICI = T} □ 

Now we are ready to estimate the probability that W is semi-saturated. We count the number 
of semi-saturated spaces. 



P(l^ is semi- saturated) < F(W = N) 

N semi-saturated 

f D \ 

< #{A^ semi-saturated | ^ W^} ( ) 

With the above bounds on the number of structured vectors and the number of submodules 
perpendicular to a given vector, we bound 

4{N I semi-sat and C, ^ struct., order T) < P^^-j — ^#{C e I ICI = T} 



Summing over every possible order T, 



\W 



#{7V I semi-sat and C e ^ struct.} < ' ^^^, 1^" 
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We now combine this with the bound on P(l^ = N) to find 



F(W semi-sat) < — T77n\W^\- 
^ '^^ lAutW^^l' ' 

By construction of W , its perpendicular module W-^ must contain at least I terms of maximal 
order, so 

\KntW^\ > \W^\^. 

We also know that \W-^\ < De"'' since W is semi-saturated, so if we take (3 (and therefore d) 
sufficiently small then we find 

F{W is semi-saturated) = 0(6"'=") 
as required. □ 
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